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Abstract of JP20011 09766 
PROBLEM TO BE SOLVED: To provide a 
document retrieving device capable of 
efficiently searching for a desired document. 
SOLUTION: A retrieval request character 
string consisting of a retrieval condition, a sort 
condition and the range specification of the 
goodness of fit to the retrieval condition is 
inputted from a retrieval request inputting 
means 107, retrieving means 108 and 109 
retrieve a document meeting the retrieval 
condition, the goodness of fit calculating 
means 110 and 111 calculate the goodness of 
it of each document, a sort information 
acquiring means 112 acquires sort information, 
a retrieval result cutting means 113 eliminates 
a document that is not included in the range of 
the goodness of fit where the goodness of it is 
subjected to range specification, a retrieval 
result rearranging means 114 first rearranges 
each document according to the sort 
information and rearranges each document in 
order of the goodness of fit when the sort 
information is the same. It is possible to 
display documents while eliminating a 
document whose goodness of fit is not 
included in the range designated by a user. 
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* NOTICES * 

JPO and INPIT are not responsible for any 
damages caused by the use of this translation. 

LThis document has been translated by computer. So the translation may not reflect the 
original precisely. 

2.**** shows the word which can not be translated. 
3.ln the drawings, any words are not translated. 



CLAIMS 

[Claim(s)] 

[Claim 1]A document retrieval system which searches accumulated document data according 

to a search condition, and rearranges and displays search results according to a sort condition, 

comprising: 

A search condition. 

A sort condition. 

A retrieval-required input means which inputs a retrieval-required character string which 
comprises range specification of goodness of fit which shows a degree corresponding to said 
search condition. 

A search means to search a document which fulfills said search condition. 

A goodness-of-fit calculating means which computes said goodness of fit of each document 

searched by said search means. 

A sorting information acquisition means which acquires sorting information for rearranging 
about said each searched document according to said sort condition. 
A search-results cut-off point means except a document in which goodness of fit does not go 
into said goodness-of-fit range by which range specification was carried out from said each 
searched document, A search-results rearrangement means which rearranges by said sorting 
information, and first rearranges each document included in said goodness-of-fit range 
outputted from said search-results cut-off point means in order of said goodness of fit when 
said sorting information is the same, A search-results displaying means which displays search 
results rearranged by said search-results rearrangement means. 

[Claim 2]A document retrieval system which searches accumulated document data according 
to a search condition, and rearranges and displays search results according to a sort condition, 
comprising: 
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A search condition. 
A sort condition. 

A retrieval-required input means which inputs a retrieval-required character string which 
comprises range specification of goodness of fit which shows a degree corresponding to said 
search condition. 

A search means to search a document which fulfills said search condition. 

A goodness-of-fit calculating means which computes said goodness of fit of each document 

searched by said search means. 

A sorting information acquisition means which acquires sorting information for rearranging 
about said each searched document according to said sort condition. 
A search-results division means to give a division flag which shows whether goodness of fit of 
each of said searched document is gone into said goodness-of-fit range as compared with said 
goodness-of-fit range by which range specification was carried out to said each document, 
First, rearrange with said division flag, and when a value of said division flag is the same, each 
document in which said division flag outputted from said search-results division means was 
given, A search-results rearrangement means which is rearranged by said sorting information, 
and is rearranged in order of said goodness of fit when said sorting information is the same, 
and a search-results displaying means which displays search results rearranged by said 
search-results rearrangement means. 

[Claim 3]While searching a document corresponding to said search condition, said search 
means computes the frequency of occurrence of a search term in each document, and said 
goodness-of-fit calculating means. The document retrieval system according to claim 1 or 2 
computing said goodness of fit of each document based on the frequency of occurrence of a 
search term computed by said search means. 

[Claim 4]While searching a document corresponding to said search condition, said search 
means computes a document number in which a search term appears, and the frequency of 
occurrence of a search term in each document, and said goodness-of-fit calculating means, 
The document retrieval system according to claim 1 or 2 computing said goodness of fit of 
each document based on the frequency of occurrence of a search term in each document, and 
an appearance document number of a search term. 

[Claim 5]The document retrieval system according to claim 1 or 2 which it has the following, 
and said goodness-of-fit calculating means outputs relative goodness of fit expressed with said 
relative value as goodness of fit of each document, and is characterized by said retrieval- 
required input means performing said range specification of goodness of fit with said relative 
goodness of fit. 

An absolute goodness-of-fit calculation means as which said goodness-of-fit calculating means 
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calculates goodness of fit of each document. 

A relative goodness-of-fit calculation means which changes into a relative value over the 
highest goodness of fit of them goodness of fit of each document calculated by said absolute 
goodness-of-fit calculation means. 

[Claim 6]ln a document retrieval method which searches accumulated document data 
according to a search condition, and rearranges and displays search results according to a 
sort condition, As opposed to retrieval required which specifies a search condition, a sort 
condition, and the range of goodness of fit which shows a degree corresponding to said search 
condition. Search a document which fulfills said search condition from accumulated document 
data, and said goodness of fit of each detected document is computed, Sorting information of 
each of said document for rearranging said each document according to said sort condition is 
acquired, A document retrieval method rean-anging by said sorting information, first 
rearranging each document which remained in search results except for a document which 
does not go into the range of goodness of fit specified by said retrieval required from search 
results in order of said goodness of fit when said sorting information is the same, and 
displaying it. 

[Claim 7]ln a document retrieval method which searches accumulated document data 
according to a search condition, and rearranges and displays search results according to a 
sort condition, As opposed to retrieval required which specifies a search condition, a sort 
condition, and the range of goodness of fit which shows a degree corresponding to said search 
condition, Search a document which fulfills said search condition from accumulated document 
data, and said goodness of fit of each detected document is computed, A document which 
does not go into the range of goodness of fit specified by said retrieval required is removed 
from search results, A document retrieval method characterized by what sorting information of 
each of said document for rearranging each document which remained in search results 
according to said sort condition is acquired, and said each document is first rearranged by said 
sorting information, it rearranges in order of said goodness of fit when said sorting information 
is the same, and is displayed. 

[Claim 8]ln a document retrieval method which searches accumulated document data 
according to a search condition, and rearranges and displays search results according to a 
sort condition, As opposed to retrieval required which specifies a search condition, a sort 
condition, and the range of goodness of fit which shows a degree corresponding to said search 
condition, Search a document which fulfills said search condition from accumulated document 
data, and said goodness of fit of each detected document is computed, Sorting information of 
each of said document for rearranging said each document according to said sort condition is 
acquired, A division flag which shows whether goodness of fit of each of said document is 
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gone into said range as compared with the range of goodness of fit specified by said retrieval 
required is given to said each document, A document retrieval method it rearranges with said 
division flag, rearranging by said sorting infomnation first when a value of said division flag is 
the same, rean-anging said each document in order of said goodness of fit when said sorting 
information is the same, and displaying it. 

[Claim 9]Claim 6 characterized by computing relative goodness of fit to the highest goodness 
of fit of the goodness of fit of each of said document, and enabling it to specify the range of 
goodness of fit with said relative goodness of fit in said retrieval required as said goodness of 
fit of each detected document, the document retrieval method according to claim 7 or 8. 



[Translation done.] 
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DETAILED DESCRIPTION 

[Detailed Description of the Invention] 
[0001] 

[Field of the lnvention]The degree with which each document agrees in a search condition 
especially about the document retrieval system and document retrieval method with which this 
invention searches a desired document according to a search condition, If it is the bibliographic 
items which accompany each document, for example, a newspaper article, search results will 
be rearranged and it will enable it to display with combination, such as new order of the date. 
[0002] 

[Description of the Prior Art]ln recent years, based on the frequency of occurrence of the 
search term in a document, etc., the technique of the document ranking which asks for the 
goodness of fit of a document and a search condition, and rearranges and displays a result on 
the high order has attracted attention. If it is the bibliographic items which accompany a 
document, for example, a newspaper article, the date will be specified as a sort condition, and 
it will give priority and display from the new report of the date, but about the report of the same 
date, flexible search [ say / displaying on order with high goodness of fit with a search 
condition ] has been realized. 

[0003]The word frequency information extraction means 1303 which extracts the word 
frequency information on the word which appears in the dictionary 1302 from the document 
data 1301 of the newspaper article used as a retrieval object, and is stored in the word 
frequency index 1304 as the conventional document retrieval system is shown in drawing 13 , 
A bibliographic-items extraction means 1305 to take out the information on bibliographic items, 
such as a date and a paper name code, from the document data 1301 , and to store in the 
bibliographic-items index 1306, The retrieval-required input means 1307 for a user to input the 
retrieval-required character string which consists of a search condition and a sort condition, 
The word frequency calculation means 1308 which asks for the frequency of occurrence in the 
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inside of the document of the search term which investigates the word frequency index 1304 
and is contained in a search condition, The logic operation means 1309 which performs the 
logical operation during a record set, and the goodness-of-fit calculation means 1310 which 
computes the goodness of fit of a search condition and each record, As a result of rearranging 
the record list of search results with the sorting information acquisition means 1311 which 
acquires the bibliographic information for rearrangement specified as the sort condition, and 
bibliographic information and goodness of fit, it has the rearrangement means 1312 and a 
result display means 1313 to display search results. 

[0004]The frequency of occurrence of a dictionary word retrieval object sentence in the letter is 
stored in the word frequency index 1304 by the extraction operation of the word frequency 
information extraction means 1303. 

f0005] Drawing 14 is a flow chart which shows the procedure of the search in the conventional 
document retrieval system. The document data 1301 comprises two or more records 
(document) divided with the record delimiter, and each record comprises two or more fields 
divided with the field delimiter. Drawing 3 shows the example of document data, and a field 
delimiter is '"^F", and a record delimiter is '"^R", and it is newspaper article data which 
comprises the three fields called a paper name code, a date, and the body of story. 
[0006]The word frequency information extraction means 1303 scans the document data 1301 
beforehand, It counts how many times the word registered into the dictionary 1302 has 
appeared in the body-of-story field of each record, and stores in the word frequency index 
1304 with the record count in which the word concerned has appeared, and the total record 
count. 

[0007]The bibliographic-items extraction means 1305 scans the document data 1301 
beforehand, and stores the contents of the bibliographic-items field of each record in the 
bibliographic-items index 1306. 

[0008]First, the step 1401 : A user inputs a retrieval-required character string by the retrieval- 
required input means 1307. A retrieval-required character string consists of two portions, a 
search condition and a sort condition. Drawing 15 shows the example of the retrieval-required 
character string, and is "Matsushita AND. The portion of new product" is a search condition, It 
means searching a report which contains in the body of story both the two search terms 
"Matsushita" and a "new product", and the portion of "@HIDUKE @SHIMEI" is a sort condition 
and it means putting search results in order in the new order of the date, and putting the same 
date in order in order with a small paper name code. Both of dates and paper name codes 
arrange in order of goodness of fit, when the same. 

[0009]Step 1402 : the word frequency calculation means 1308, The step 1403:word frequency 
index 1304 is referred to for all the search terms, About the search term contained in the 
search condition inputted by the retrieval-required input means 1307, the word concerned 
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computes the frequency of occurrence and the total record count of the internal number of the 
record count which appears in the body of story, and each record, and the word concerned In 
each record. 

[0010]Step 1404: The logic operation means 1309 perfomris the logical operation during the 

record set which the word frequency calculation means 1308 outputted. 

[0011]Step 1405: The goodness-of-fit calculation means 1310 computes goodness of fit (Rel) 

with a search condition, for example (several 1) about each record which the step 1406:logic 

operation means 1309 outputted for all the search-results records. 

Rel = sigma(TFi-IDFi) (sigma adds about i) 

IDFi =1-log2(DFi/ND) (several 1) 

However, the record count and ND in which, as for TFi, the frequency of occurrence in a 
record of the search term Wi appears, and, as for DFi, the word Wi appears express the total 
record count. 

[0012]The calculating method of goodness of fit is not restricted to (several 1). 
[0013]Step 1407: The sorting information acquisition means 131 1 acquires the value of the 
bibliographic items corresponding to the sort condition inputted from the retrieval-required input 
means 1307 of each record which the goodness-of-fit calculation means 1310 outputted as 
sorting information with reference to the bibliographic-items index 1306. 
[0014] Drawing 6 shows the example of the contents of an output of the sorting information 
acquisition means 131 1, and acquires the value of the date and a space code as sorting 
Information. 

[0015]Step 1408: In a result, the rearrangement means 1312 rearranges and outputs the 
output of the sorting information acquisition means 1311 by making into a sort key two or more 
bibliographic items acquired as sorting information. At this time, when there is the record with 
same value of all the bibliographic items, it rearranges into descending of goodness of fit. 
[0016]There is drawing 16 in the example of the contents of an output of the result 
rearrangement means 1312. 

[0017]Step 1409: The result display means 1313 operates the output of the result 

rearrangement means 1312 orthopedically, and shows it to a user. 

[0018] 

[Problem(s) to be Solved by the lnvention]However, since priority is given over goodness of fit 
to the value etc. of the bibliographic items specified as the sort condition as a key of 
rearrangement in the conventional composition, A document with low goodness of fit will be 
ranked as a higher rank, a document with high goodness of fit may be ranked as a low rank, 
and there was a problem that a desired document was efficiently nondiscoverable. 
[0019]For example, the document (record internal number 10) ranked as the lowest in drawing 
8 hits this. 
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[0020]Though seriously taken as a key of rearrangement of the value of the bibliographic items 
which this invention solves the technical problem of such conventional technology, and were 
specified as the sort condition, It aims at providing the document retrieval system which can 
discover a desired document efficiently, and providing the document retrieval method by 
removing from a result the document which does not go into the goodness-of-fit range which 
the user could limit the range of goodness of fit, and specified, or ranking it as a low rank more. 

[0021] 

[Means for Solving the Probiem]So, in addition to a search condition and a sort condition, in a 
document retrieval system of this invention, a retrieval-required input means which inputs 
goodness-of-fit range specification, and a search-results cut-off point means excluding a 
document in which goodness of fit does not go into a specified goodness-of-fit range from 
search results are formed as a retrieval-required character string. 

[0022]A retrieval-required input means which inputs goodness-of-fit range specification as a 
retrieval-required character string in addition to a search condition and a sort condition, and 
goodness of fit of a document have formed an intermediate result division means to give a 
different division flag, by whether it corresponds to a specified goodness-of-fit range. 
[0023]ln a document retrieval method of this invention, retrieval required which specifies a 
search condition, a sort condition, and the range of goodness of fit is received. Search a 
document which fulfills a search condition from accumulated document data, and goodness of 
fit of each searched document is computed. It rearranges by sorting information, and when 
sorting informafion is the same, he rearranges each document which remained in search 
results except for a document which does not go into the range of goodness of fit specified by 
retrieval required from search results in order of goodness of fit, and is trying to acquire sorting 
information of each document and to display it first. 

[0024]A procedure which acquires sorting information of each of this document, and a 
procedure excluding a document which does not go into the range of goodness of fit specified 
by retrieval required from search results are replaced. 

[0025]Retrieval required which specifies a search condition, a sort condition, and the range of 
goodness of fit which shows a degree corresponding to a search condition is received. Search 
a document which fulfills a search condition from accumulated document data, and goodness 
of fit of each detected document is computed, Acquire sorting information of each document 
for rearranging each document according to a sort condition, and goodness of fit of each 
document is measured with the range of goodness of fit specified by retrieval required, A 
division flag which shows whether it goes into the range is given to each document, by 
rearranging with a division flag first, and rearranging by sorting information, when a value of a 
division flag is the same, each document is rearranged in order of goodness of fit, when sorting 
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information is the same, and he is trying to display. 

[0026]Therefore, goodness of fit can remove a document which separates from a range which 
a user specified from search results, or it can be ranked more as a low rank, and a problem 
that a document with low goodness of fit at the time of specifying a sort condition will be 
ranked as a higher rank, and a document with high goodness of fit will be ranked as a low rank 
can be avoided. 
[0027] 

[Embodiment of the lnvention]Hereafter, an embodiment of the invention is described, referring 
to a figure. 

[0028](A 1st embodiment) Drawing 1 is a block diagram showing the composition of the 
document retrieval system in a 1st embodiment of this invention. 
[0029]This device is provided with the following. 

Like the conventional device ( drawing 13 ), The word frequency information on the word which 
appears in the dictionary 102 is extracted from the document data 101 of the newspaper article 
used as a retrieval object. The information on bibliographic items, such as a date and a paper 
name code, is taken out from the word frequency information extraction means 103 stored in 
the word frequency index 104, and the document data 101. a bibliographic-items extraction 
means 105 to store in the bibliographic-items index 106, the retrieval-required input means 
107, the word frequency calculation means 108, the logic operation means 109, the goodness- 
of-fit calculation means 110, the sorting information acquisition means 112, and a result - the 
rearrangement means 114- and, The relative goodness-of-fit calculation means 111 which 
changes into the relative value over the maximum the goodness of fit of each record calculated 
by the goodness-of-fit calculation means 110, and is outputted to the sorting information 
acquisition means 112 while having the result display means 115. 

The intermediate result cut-off point means 113 except the record which does not go into the 
goodness-of-fit range which the value of goodness of fit specified from the search results 
outputted from the sorting information acquisition means 112. 

[0030]The flow chart of drawing 2 shows the procedure of the search in a 1st embodiment. The 
document data 101 comprises two or more records (document) divided with the record 
delimiter, and each record comprises two or more fields divided with the field delimiter. It is an 
example of document data, and a field delimiter is "'^F", a record delimiter is "'^R", and drawing 
3^is newspaper article data which comprises the three fields called a paper name code, a date, 
and the body of story. 

[0031 ]The word frequency information extraction means 103 scans the document data 101 
beforehand, counts how many times the word registered into the dictionary 102 has appeared 
in the body-of-story field of each record, and stores it in the word frequency index 1304 with 
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the record count in which the word concerned has appeared, and the total record count, 
[0032]The bibliographic-items extraction means 105 scans said document data 101 
beforehand, and stores the contents of the bibliographic-items field of each record in the 
bibliographic-items index 106. 

[0033]First, the step 201: A user inputs a retrieval-required character string by the retrieval- 
required input means 107. A retrieval-required character string consists of three portions, a 
search condition, a sort condition, and goodness-of-fit range specification. Drawing 4 shows 
the example of the retrieval-required character string, and is "Matsushita AND. The portion of 
new product" is a search condition, It means searching a report which contains in the body of 
story both the two search terms "Matsushita" and a "new product", and the portion of 
"@HIDUKE @SHIMEI" is a sort condition, It means putting search results in order in the new 
order of the date, and putting the same date in order in order with a small paper name code, 
and means that the portion of "$70:" includes in a result only the report whose relative 
goodness of fit to the report whose goodness of fit it is goodness-of-fit range specification, and 
is the maximum is 70 or more. Both of dates and paper name codes arrange in order of 
goodness of fit, when the same. It is also possible to specify both of the minimums and 
maximums of goodness-of-fit range specification, as shown in "$70:90", and to specify only 
specification that goodness of fit includes or more 70 90 or less report in a result, and a 
maximum. 

[0034]Step 202 : the word frequency calculation means 108, The step 203:word frequency 
index 104 is referred to for all the search terms, About the search term contained in the search 
condition inputted by the retrieval-required input means 107, the word concerned computes the 
frequency of occurrence and the total record count of the internal number of the record count 
which appears in the body of story, and each record, and the word concerned in each record. 
[0035]Step 204: The logic operation means 109 performs the logical operation during the 
record set which the word frequency calculation means 108 outputted. Drawing 5 shows the 
example of the contents of an output of the logic operation means 109 in the case of the 
retrieval-required character string shown in drawing 4 , and the record set in which both 
"Matsushita" and a "new product" appear is called for. 

[0036]Step 205: The goodness-of-fit calculation means 110 computes goodness of fit with a 
search condition with the above (several 1), for example about each record which the step 
206:logic operation means 109 outputted for all the search-results records. 
[0037]Step 207: Change the relative goodness-of-fit calculation means 111 into the value 
which **(ed) goodness of fit of each record which the goodness-of-fit calculation means 110 
outputted at those maximums, and increased it 100 times. 

[0038]Step 208: The sorting information acquisition means 112 acquires the value of 
bibliographic items of each record which the relative goodness-of-fit calculation means 1 1 1 
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outputted with reference to the bibliographic-items Index 106 according to the sort condition 
inputted by the retrieval-required input means 107 as sorting information. Drawing 6 is an 
example of the contents of an output of the sorting information acquisition means 112, and 
acquires the value of the date and a space code as sorting information. 
[0039]Step 209 : the intermediate result cut-off point means 113, being aimed at all the records 
outputted from the sorting information acquisition means 112 ~ step 210: ~ the record which 
confirms whether the goodness of fit of the record corresponds to the goodness-of-fit range 
specification inputted from the retrieval-required input means 107, and has not done step 
21 1 :relevance of it is excepted. 

[0040] Drawing 7 is an example of the contents outputted from the intermediate result cut-off 
point means 113, when goodness-of-fit range specification is 70 or more. 
[0041]Step 212: The rearrangement means 1 14 makes a sort key two or more bibliographic 
items acquired as sorting information, and rearranges the output of the intermediate result cut- 
off point means 113, and a result rearranges and outputs it to descending of goodness of fit, 
when the value of all the bibliographic items is the same record. As a result, drawing 8 is an 
example of the contents of an output of the rearrangement means 114. order with the new date 
and a small paper name code ~ a result ~ a document ~ arranging ~ having ~ and since the 
report which was outside the range specified by goodness of fit is excepted, the user can find a 
desired document efficiently. 

[0042]Step 213: The result outputting means 115 operates the output of the result 
rearrangement means 114 orthopedically, and shows it to a user. 

[0043]Thus, since it can display except for the document which does not go into the goodness- 
of-fit range out of the document searched with this document retrieval system, a desired 
document is efficiently discoverable. 

[0044]When cutting off the document of search results with goodness of fit, search results are 
once sorted with goodness of fit, and the way goodness of fit cuts off the document which is 
less than a predetermined value is also considered, but since there are many document 
numbers of the search results before a cut-off point, the processing burden of sorting for this 
document becomes very heavy, or [ on the other hand, / that the goodness of fit of a document 
goes into the specified goodness-of-fit range in the method of this embodiment ] ~ since ****** 
is only checked to each document, compared with said sorting application, it becomes light 
processing. Therefore, a document-retrieval result can be displayed promptly. 
[0045]lt may be made to perform acquisition of the sorting information of Step 208 for the 
document after YES of Step 209 (i.e., after carrying out the cut-off point of search results), and, 
in such a case, the rating of acquisition of sorting information can be reduced. 
[0046](A 2nd embodiment) A 2nd embodiment explains the document retrieval system which 
distinguishes by the rank of goodness of fit and displays a document. 
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[0047]This device is provided with an intermediate result division means 913 to give a division 
flag which is different by whether it goes into the goodness-of-fit range as which the value of 
goodness of fit was specified to the record of the search results outputted from the sorting 
information acquisition means 912 as shown in drawing 9 . Unlike a 1st embodiment, it does 
not have an intermediate result cut-off point means. Other composition does not have a 1st 
embodiment ( drawing 1 ) and a change. 

[0048] Drawing 10 is a flow chart in a 2nd embodiment which shows the procedure of search. 
Here, the procedure to Step 1008 is the same procedure as a 1st embodiment. 
[0049]Step 1009 : the intermediate result division means 913, It is aimed at all the records 
outputted from the sorting information acquisition means 912, Step 1010 : It is confirmed 
whether the goodness of fit of the record corresponds to the goodness-of-fit range specification 
inputted from the retrieval-required input means 907, Step 101 1 : Give "2" as a value of a 
division flag about the record which does not correspond to the goodness-of-fit range, and give 
"1" as a value of a division flag about the record applicable to the step 1012:goodness-of-fit 
range. 

[0050] Drawing 11 is an example of the contents of an output of the intermediate result division 
means 913. 

[0051]When both a minimum and a maximum are specified as a goodness-of-fit range, The 
intermediate result division means 913 subdivides further the record which does not 
correspond to the goodness-of-fit range, and it may be made to give "3" to the record which is 
less than a minimum in "2" as a value of a division flag at the record exceeding a maximum as 
a value of a division flag. 

[0052]As a result of Step 1013:, the rearrangement means 914, Rearrange in descending 
order of the value of a division flag, and when the value of a division flag is the same, the 
output of the intermediate result division means 913, Two or more bibliographic items acquired 
as sorting information are rearranged as a sort key, and when there is the record with same 
value of all the bibliographic items, it rearranges and outputs to descending of goodness of fit. 
[0053]There is drawing 12 in the example of the contents of an output of the result 
rearrangement means 914. order with the new date and a small paper name code a result - 
a document - arranging ~ having - and since the report which was outside the range 
specified by goodness of fit is ranked as a low rank rather than article groups which has 
goodness of fit in a designated range, the user can find a desired document efficiently. 
[0054]Step 1014: The result outputting means 915 operates the output of the result 
rearrangement means 914 orthopedically, and shows it to a user. 
[0055]Thus, all the documents searched with the document retrieval system of this 
embodiment can be classified and displayed on the thing included in the goodness-of-fit range, 
and the thing which does npt enter. The user can also end a document retrieval, being able to 
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see only the document of the classification applicable to the goodness-of-fit range according to 
the purpose of search, and like [ when searching a patent document ], when one leakage is not 
allowed, either, he can investigate in detail also about the divisional document which separates 
from the goodness-of-fit range. 
[0056] 

[Effect of the lnvention]ln the document retrieval system and document retrieval method of this 
invention, goodness of fit can remove the document which separates from the range which the 
user specified from search results, or it can be ranked more as a low rank so that clearly from 
the above explanation. 

[0057]By doing so, the problem that the document with low goodness of fit at the time of 
specifying a sort condition will be ranked as a higher rank, and a document with high goodness 
of fit will be ranked as a low rank can be avoided, and it becomes possible to search a desired 
document efficiently. 

[0058]The suitable goodness-of-fit range can be easily specified by changing the goodness of 
fit of each document into the relative value over the maximum, and specifying the goodness-of- 
fit range specification in retrieval required with a relative value. 
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* NOTICES * 

JPO and INPIT are not responsible for any 
damages caused by the use of this translation. 

IThis document has been translated by computer. So the translation may not reflect the 
original precisely. 

2.**** shows the word which can not be translated. 
3.ln the drawings, any words are not translated. 



DESCRIPTION OF DRAWINGS 

[Brief Description of the Drawings] 

[Drawing 1] The block diagram showing the composition of the document retrieval system in a 
1st embodiment of this invention, 

[Drawing 2] The flow chart showing the procedure of the retrieval processing in a 1st 
embodiment, 

[Drawing 3] The figure showing an example of document data, 

[Drawing 4] The figure showing an example of the retrieval-required character string in a 1st 
embodiment, 

[Drawing 5] The figure showing an example of the contents of an output of the logic operation 
means in a 1st embodiment, 

[Drawing 6]T he figure showing an example of the contents of an output of the sorting 
information acquisition means in a 1st embodiment, 

[Drawing 7] The figure showing an example of the contents of an output of the intermediate 
result cut-off point means in a 1st embodiment, 

[Drawing 8] The figure the result in a 1st embodiment showing an example of the contents of an 
output of a rearrangement means, 

[Drawing 9] The block diagram showing the composition of the document retrieval system in a 
2nd embodiment of this invention, 

[Drawing 10] The flow chart showing the procedure of the retrieval processing in a 2nd 
embodiment, 

[Drawing 11] The figure showing an example of the contents of an output of the intermediate 
result division means in a 2nd embodiment, 

[Drawing 121 The figure the result in a 2nd embodiment showing an example of the contents of 
an output of a rearrangement means, 

[Drawing 131T he block diagram showing the composition of the conventional document 
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retrieval system, 

[Drawing 14] The flow chart showing the procedure of the conventional retrieval processing, 
[Drawing 15l The figure showing an example of a retrieval-required character string, 
[Drawing 16] lt is a figure showing an example which is the contents of an output of a result 
rearrangement means. 
[Description of Notations] 

101, 901, 1301 document data 

102, 902, and 1302 Dictionary 

103, 903, and 1303 Word frequency information extraction means 

104, 904, and 1304 Word frequency index 

105, 905, and 1305 Bibliographic-items extraction means 

106, 906, and 1306 Bibliographic-items index 

107, 907, and 1307 Retrieval-required input means 

108, 908, and 1308 Word frequency calculation means 

109, 909, and 1309 Logic operation means 

110, 910, and 1310 Goodness-of-fit calculation means 
111 and 91 1 Relative goodness-of-fit calculation means 
112, 912, a 1311 sorting-information acquisition means 
113 Intermediate result cut-off point means 

114, 914, and 1312 Result rearrangement means 

1 1 5, 915, and 1 31 3 Result display means 
913 Intermediate result division means 
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Drawing 3] 
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Drawing 6] 
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[Drawing 15] 
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[Drawing 81 
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[Drawing 9] 
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[Drawing 11] 
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[Drawing 12] 
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[Drawing 13] 
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[Drawing 10] 
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[Drawing 141 
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